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Abstract. We study random nilpotent groups in the well-established style of random groups, by 
choosing relators uniformly among freely reduced words of (nearly) equal length and letting the 
length tend to infinity. Whereas random groups P = EVn/((R)) are quotients of a free group by 
such a random set of relators, random nilpotent groups are formed as corresponding quotients G = 
^s,m /({/?)) of a free nilpotent group. 

Using arithmetic uniformity for the random walk on and group-theoretic results relating a 
nilpotent group to its abelianization, we are able to deduce statements about the distribution of 
ranks for random nilpotent groups from the literature on random lattices and random matrices. 
We obtain results about the distribution of group orders for some finite-order cases as well as the 
probability that random nilpotent groups are abelian. For example, for balanced presentations 
(number of relators equal to number of generators), the probability that a random nilpotent group 
is abelian can be calculated for each rank m, and approaches 84.69...% as m ^ oo. Further, abelian 
implies cyclic in this setting (asymptotically almost surely). 

Considering the abelianization also yields the precise vanishing threshold for random nilpotent 
groups—the analog of the famous density one-half theorem for random groups. A random nilpotent 
group is trivial if and only if the corresponding random group is perfect, i.e., is equal to its commuta¬ 
tor subgroup, so this gives a precise threshold at which random groups are perfect. More generally, 
we describe how to lift results about random nilpotent groups to obtain information about the lower 
central series of standard random groups. 


1. Introduction and background 

1.1. Random groups. The background idea for the paper is the models of random groups T = 
An/((R)), where Fm is the free group on some number m of generators, and i? is a set of relators of 
length I chosen by a random process. Typically one takes the number of relators \R\ to be a function of 
i] for fixed £, there are finitely many choices of i? of a certain size, and they are all made equally likely. 
For instance, in the few-relators model, |i?| is a fixed constant, and in the standard density model, 
|i?| = (2m - l)'^^ for a density parameter 0 < d < 1. (When the number of relators has sub-exponential 
growth, this is often regarded as sitting in the density model at density zero.) 

After fixing \R\ as a function of £, we can write Pr(r has property P) = p to mean that the 
proportion of such presentations for which the group has P tends to p as £ ^ oo. In particular, we say 
that random groups have P asymptotically almost surely (a.a.s.) if the probability tends to 1. 

The central result in the study of random groups is the theorem of Gromov-Ollivier stating that 
for d > 1/2 in the density model, T is a.a.s. isomorphic to either {1} or Z/2Z (depending on the parity 
of £), while for d < 1/2, T is a.a.s. non-elementary hyperbolic and torsion-free [121 Thm 11]. In the 
rest of this paper, we will choose our relators from those of length £ and £ - 1 with equal probability 
in order to avoid the parity issue; with this convention, T = {1} a.a.s. for d > 1/2. 

The Gromov-Ollivier theorem tells us that the density threshold for trivializing a free group co¬ 
incides with the threshold for hyperbolicity, which means that one never sees other kinds of groups, 
for example abelian groups, in this model. Indeed, because 7? can not appear as a subgroup of a 
hyperbolic group, one never sees a group with even one pair of commuting elements. To be precise, all 
finitely-generated groups are quotients of Fm, but probability of getting a nontrivial, non-hyperbolic 
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group (or a group with torsion) is asymptotically zero at every density d + 1/2. Furthermore the 
recent paper [4] shows that this trivial/hyperbolic dichotomy seems to persist even at d= 1/2. 

However, it is a simple matter to create new models of random groups by starting with a different 
“seed” group in place of the free group Fm- The r random strings in {oi,... ,0^} that are taken as 
relators in the Gromov model can be interpreted as elements of any other group with m generators. 
For instance, forming random quotients of the free abelian group Z™ in this way would produce 
a model of random abelian groups; equivalently, the random groups arise as cokernels of random 
m X r integer matrices with columns given by the Gromov process, and these clearly recover the 
abelianizations of Gromov random groups. Random abelian groups are relatively well-studied, and 
information pertaining to their rank distribution can be found in at least three distinct places: the 
important paper of Dunfield-Thurston testing the virtual Haken conjecture through random models 
[3 §3-14]; the recent paper of Kravchenko-Mazur-Petrenko on generation of algebras by random 
elements [9]; and the preprint of Wang-Stanley on the Smith normal form distribution of random 
matrices HU. These papers use notions of random matrices that differ from the one induced by the 
Gromov model, but we will explain some of the distinctions below. By contrast, there are many other 
ways that random abelian groups arise in mathematics: as class groups of imaginary quadratic fields, 
for instance, or as cokernels of graph Laplacians for random graphs (also known as sandpile groups). 
For a discussion of heuristics for these various distributions and a useful survey of some of the random 
abelian group literature, see [IH] and its references. 

In this paper we initiate a study of random nilpotent groups by beginning with the free nilpotent 
group Ns^m of step s and rank in and adding random relators as above. Note that all nilpotent groups 
occur as quotients of appropriate Ng^m, just as all abelian groups are quotients of some Z™ and all 
groups are quotients of some Fm (here and throughout, groups are taken to be finitely generated). By 
construction, these free nilpotent groups can be thought of as “nilpotentizations” of Gromov random 
groups; their abelianizations will agree with those described in the last paragraph (cokernels of random 
matrices), but they have a nontrivial lower central series and therefore retain more information about 
the original random groups. 

Below, we begin to study the typical properties of random nilpotent groups. For instance, one 
would expect that the threshold for trivialization occurs with far fewer relators than for free groups, 
and also that nontrivial abelian quotients should occur with positive probability at some range of 
relator growth. 

The results of this paper are summarized as follows: 


• In the remainder of this section, we establish a sequence of group theory and linear algebra 
lemmas for the following parts. 

• In 121 the properties of Z™ random walk and its non-backtracking variant are described in 
order to deduce arithmetic statistics of Mal’cev coordinates. 

• We survey the existing results from which ranks of random abelian groups can be calculated; 
a theorem of Magnus guarantees that the rank of a nilpotent group equals the rank of its 
abelianization. (|3]) 

• We give a complete description of one-relator quotients of the Heisenberg group, and compute 
the orders of finite quotients with any number of relators. o 

• Using a Freiheitssatz for nilpotent groups, we study the consequences of rank drop, and 
conclude that abelian groups occur with probability zero for \R\ < m - 2, while they have 
positive probability for larger numbers of relators. Adding relators in a stochastic process 
drops the rank by at most one per new relator, with statistics for successive rank drop given 
by number-theoretic properties of the Mal’cev coordinates. m 

• We give a self-contained proof that a random nilpotent group is a.a.s. trivial exactly if \R\ is 
unbounded as a function of £. We show how information about the nilpotent quotient lifts to 
information about the LCS of a standard (Gromov) random group and observe that standard 
random groups are perfect under the same conditions. 
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• Finally, the last section records experimental data gathered in Sage for random quotients of 
the Heisenberg group, showing in particular the variety of non-isomorphic groups visible in 
this model of random nilpotent groups and indicating some of their group-theoretic properties. 

m 

1.2. Nilpotent groups and Mal’cev coordinates. Nilpotent groups are those for which nested 
commutators become trivial after a certain uniform depth. We will adopt the commutator convention 
that [a,6] = and define nested commutators on the left by [a,5,c] = [[a,5],c], [a,&,c,d] = 

[[[a,6],c],d], and so on. Within a group we will write [H,K] for the subgroup generated by all 
commutators [h, k] with h ranging over H < G and k ranging over K < G, so that in particular [G, G] 
is the usual commutator subgroup of G. A group is s-step nilpotent if all commutators with s + 1 
arguments are trivial, but not all those with s arguments are. (The step of nilpotency is also known as 
the class of nilpotency.) With this convention, a group is abelian if and only if it is one-step nilpotent. 
References for the basic theory of nilpotent groups are [m Ch 9], [21 Ch 10-12]. 

In the free group Fm of rank to, let 

~ { ; • ■ ■ ; ^ij ] • 1 < Xl, . . . , Zj < TO^ 

be the set of all nested commutators with j arguments ranging over the generators. Then the free 
s-step rank-m nilpotent group is 

A^s,m ~ Fjyi j ,..., am j ,... ] for all ij ), 

where ((i?)) denotes the normal closure of a set R when its ambient group is understood. Just as 
all finitely-generated groups are quotients of (finite-rank) free groups, all finitely-generated nilpotent 
groups are quotients of free nilpotent groups. Note that the standard Heisenberg group id(Z) = {a,b\ 
[a,5, a], [a, 6,5]) is realized as A’ 2 , 2 - In the Heisenberg group, we will use the notation c = [a,5], so 
that the center is (c). 

The lower central series (LCS) for a s-step nilpotent group G is a sequence of subgroups inductively 
defined by G^+i = [Gfe,G] which form a subnormal series 

{1} = G«+i<...<G3<G2<Gi=G. 

(The indexing is set up so that [Gi,Gj] c Gi+j.) For finitely generated nilpotent groups, this can 
always be refined to a polycyclic series 

{1} = GG„+i < GG„ < ... < GG 2 < GGi = G 

where each CGijGGi+i is cyclic, so either Z or Z/n^Z. The number of Z quotients in any polycyclic 
series for G is called the Hirsch length of G. From a polycyclic series we can form a generating 
set which supports a useful normal form for G. Make a choice of Ui in each GGi so that UiCGt+i 
generates CGijGGi+i. An inductive argument shows that the set {ui,... ,Un} generates G. We call 
such a choice a Mal’cev basis for G, and we filter it as MBi □••• uMBg, with MBj consisting of basis 
elements belonging to Gj \ Gj+i. Now if Ui e MB^-, let Ti be the smallest value such that uf' e MB^+i, 
putting Ti = 00 if no such power exists. Then the Mal’cev normal form in G is as follows: every element 
g 6 G has a unique expression as g = with integer exponents and 0 < ti < Ti ii Ti < 00 . Then 

the tuple of exponents (ti,... ,tn) gives a coordinate system on the group, called Mal’cev coordinates. 
We recall that MBj u • • • u MBg generates Gj for each j and that (by definition of s) the elements of 
MBs are central. 

We will denote a Mal’cev basis for free nilpotent groups Ns^m as follows: let MBi = {m,... ,am} 
be the basic generators, let MB 2 = {bij := [ 0 ^, 0 ^] '■ i < j} be the basic commutators, and take each 
MBj as a subset of Tj^m consisting of independent commutators from [MBj_i,MBi]. We note that 
IA 1 B 2 I = (™)j and more generally the orders are given by the necklace polynomials 

|MB,| = i^/r(d)TO^/^ 

J d\j 

where p, is the Mobius function (see |7l Thm 11.2.2]). 
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For example, the Heisenberg group H{h) = A^ 2,2 has the lower central series {1} <3 Z < with 

quotients Z and Z^, so its Hirsch length is 3. H{h) admits the Mal’cev basis a,b^c (with a = oi, 
6 = 02 , and c equal to their commutator), which supports a normal form g = . The Mal’cev 

coordinates of a group element are the triple {A,B,C) € Z^. 

1.3. Group theory and linear algebra lemmas. In the free group Fm = (oi,..., Om), for any freely 
reduced g e Fm, we define Ai{g), called the weight of generator Ui in the word g, to be the exponent sum 
of Oi in 5 . Note that weights Ai,, Am are well defined in the same way for the free nilpotent group 
Ns^m for any s. We will let ab be the abelianization map of a group, so that ah{Fm) = ab(A^s,m) = 
Under this isomorphism, we can identify ab(g) with the vector A{g) := {Ai{g ),..., Am{g)) e Z"*- If 
we have an automorphism (p on Ng^m, we write for the induced map on Z™, which by construction 
satisfies aho (j)= o ab. Note that A{g) is also the MBi part of the Mal’cev coordinates for g, and 
we can similarly define a &-weight vector B( 5 ) to be the MB 2 part, recording the exponents of the bij 
in the normal form. 

To fix terminology: the rank of any finitely-generated group will be the minimum size of any 
generating set. Note this is different from the dimension of an abelian group, which we define by 
dim(Z'^ X Go) = d for any finite group Gq. (With this terminology, the Hirsch length of a nilpotent 
group G is the sum of the dimensions of its LCS quotients.) In any finitely-generated group, we say an 
element is primitive if it belongs to some basis (i.e., a generating set of minimum size). For a vector 
w = {wi ,..., Wm) 6 Z™, we will write gcd(rc) to denote the gcd of the entries. So a vector w 6 Z™ is 
primitive iff gcd(rc) = 1. In this case we will say that the tuple (rci,... ,Wm) has the relatively prime 
property or is RP. As we will see below, an element g e Ng^m is primitive in that nilpotent group if and 
only if its abelianization is primitive in Z™, i.e., if A(g) is RP. In free groups, there exists a primitive 
element with the same abelianization as g iff A{g) is RP. 

The latter follows from a classic theorem of Nielsen m- 

Theorem 1 (Nielsen primitivity theorem). For every relatively prime pair of integers there 

is a unique conjugacy class [g] in the free group F 2 = {a,b) for which A(g) = i, B(g) = j, and g is 
primitive. 

Corollary 2 (Primitivity criterion in free groups). There exists a primitive element g e Fm with 
Ai(g) = Wi for j = 1 ,..., TO if and only if gcd(wi,..., Wm) = 1 - 

Proof. Let w = (wi ,..., Wm)- H gcd(w) t 1, then the image of any g with those weights would not be 
primitive in the abelianization Z™, so no such g is primitive in Fm. 

For the other direction we use induction on to, with the base case to = 2 established by Nielsen. Sup¬ 
pose there exists a primitive element of Fm-i with given weights wi,..., Wm-i- For S = gcd(wi,..., Wm-i 
we have gcd((5, Wm) = 1- Let w = ). By the inductive hypothesis, there exists an element 

g e Fm-i such that the weights oig are w, and g can be extended to a basis {g, ^ 2 ,..., hm-i} of Fm-i- 
Consider the free group {g,am) = F 2 . Since gcd{6,Wm) = 1, there exist g,h that generate this free 
group such that g has weights Ag{g) = 5 and Am{g) = Wm by Nielsen. Consequently, Ai{g) = Wi. 
Then {g, h, /i 2 , •••, hm-i) = {g, h 2 , ■. ■, hm-i, Om) = Fm, which shows that g is primitive, as desired. □ 

The criterion for primitivity in free nilpotent groups easily follows from a powerful theorem due to 
Magnus [101 Lem 5.9]. 

Theorem 3 (Magnus lifting theorem). If G is nilpotent and S' c G is any set of elements such that 
ab(S) generates ab(G), then S generates G. 

Note that this implies that if G is nilpotent of rank to, then G/((g)) has rank at least to - 1, because 
we can drop at most one dimension in the abelianization. 

Corollary 4 (Primitivity criterion in free nilpotent groups). An element g e Ng^m is primitive if and 
only if A(g) is primitive in Z™. 

Now we establish a sequence of lemmas for working with rank and primitivity. Recall that a, b are 
the basic generators of the Heisenberg group F[{'E) and that c= [a,&] is the central letter. 
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Lemma 5 (Heisenberg basis change). For any integers *, j, there is an automorphism (j) of = iV2,2 
such that where d = gcd(i, j) and m = ^(d -1) + k. 

In particular, if i,j are relatively prime, then there is an automorphism (j) of H{'E) such that 
= b. 

Proof. Suppose ri + sj = d = gcd(i, j) for integers r, s and consider d = a^b~'^, b = We compute 

= [a^ ■ [6“", = c. 

If we set c= c, we have [a, b] = c and [c, a] = [c, 6] = I, so (a, b) presents a quotient of the Heisenberg 
group. We need to check that it is the full group. Consider h = (a)“*^'^(&)®. Writing h in terms of 
a,b,c, the a-weight of /i is 0 and the ^-weight is {ri + sj)ld = 1, so /i = bd for some t. But then 
b = {d)~''l'^{bY{c)~^ and similarly a = (a)^^‘^(6)’'(c)“* for some t', so all of a,b,c can be expressed in 
terms of a, 6, c. 

Finally, 

{bf = = db>d^'^'^^, 

which gives the desired expression dVY = (bY{c)'^ from above. □ 


Proposition 6 (General basis change). Let 5 = gcd(Hi(g),..., Am{g)) for any g e H = Ng^rn- Then 
there is an automorphism (j) oi H such that 4>{g) = for some h 6 H 2 . 

Proof. Let Wi = Ai{g) for i = 1,...,m and let ri = Wijd, so that gcd(ri,... ,rm) = 1. By Corollary[2l 
there exists a primitive element x e Fm with weights ri. Let ^ be a change of basis automorphism of 
Fm such that <fi{x) = am- This induces an automorphism of PI, which we will also call if. 

By construction, and g have weight w. Since ab(x‘^) = ab( 5 ) = w, we must have (fY°{w) = 
ah{(j){x^)) = ab{(j){g)). Therefore (fix^) and (j){g) have the same weights. 

Then ab{(j){g)) = ah{(j){x^)) = ab((/)(a;)‘^) = ab(a^), so </>((;) and only differ by commutators, 
i.e., (j){g) = Yn'h for some h 6 F 12 . □ 

Remark 7. Given an abelian group G = 17^I{R), the classification of finitely-generated abelian groups 
provides that there are non-negative integers di,..., dm with dm\dm-i \. ■. |di such that G = '^Idih. 
If G has dimension q and rank r, then di = ■■■ = dq = 0, and dr+i = • • • = dm = I, so that 

G = 7P X {Tjjdq+ll X • • • X Ijdrlj) . 

Now consider a projection map / : Z"* -*• 17^ jK = "^Idil. We can choose a basis Ci,..., of 17^ 
so that 

m 

K — spanj^diCi,..., dm^m^ = diPj. 

i=l 

Then since every element in iF is a linear combination of {diCi,..., dm^m} and dm\dm-i\ ■ • ■ Mi, we 
have that dm divides all the coordinates of all the elements in K. Also dm^-m s K with being 
primitive. 

Lemma 8 (Criterion for existence of primitive vector). Consider a set of r vectors in Z™, and let d 
be the gcd of the rm coordinate entries. Then there exists a vector in the span such that the gcd of 
its entries is d, and this is minimal among all vectors in the span. 

In particular, a set of r vectors in Z™ has a primitive vector in its span if and only if the gcd of 
the rm coordinate entries is 1. 


Proof. With d as above, let K be the Z-span of the vectors and let 

7 := inf gcd(ui). 

w^K 

One direction is clear: every vector in the span has every coordinate divisible by d, so 7 > d. On the 
other hand dm^m e K and gcd(dmem) = dm because Cm is primitive. But dm is a common divisor of 
all rm coordinates, and d is the greatest such, so dm < d and thus 7 < d. □ 
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Lemma 9 (Killing a primitive element). Let H = Ng^m and let K be a normal subgroup of H. If 
rank(iJ/K) < m then K contains a primitive element. 

Proof. Since rank(iJ/K) < m, we also have rank(ab(i?/K)) < m. Writing ab(iJ/Kr) = 

as above, we have dm = 1- By the previous lemma there is a primitive element in the kernel of the 

projection ab(iJ) ->• ab{HlK), and any preimage in K is still primitive (see CorS]). □ 

Lemma 10 (Linear algebra lemma). Suppose ui,... ,Un e and suppose there exists a primitive 
vector V in their span. Then there exist V 2 ,. ■ ■ ,Vn such that span(u, V 2 ,. ■ ■, Vn) = span(iti,..., Un). 

Proof. Since v e span(ui,... ,u„), we can write v = aiUi + ••• + a„u„. Let a: e Z" be the vector with 
coordinates at. Because gcd(u) = 1, we have gcd(ai) = 1, so a: is primitive. Thus, we can complete x 



' -V- '' 


' -X- '' 


' -Ui- '' 


' -X- '' 

to a basis of Z", say {x,X 2 ^. ■. ,Xn}. Then take 

-V2- 

= 

-X2- 


-U2- 

. Since 

-X2- 


\ -'Vn- J 


1 -Xn- / 




\ —Xn— j 


SLn{Z), it represents a change of basis matrix, so we have span(w,ti 2 ,•••,u„) = span(ui,•••,u„), as 
needed. □ 

Lemma 11 (String arithmetic). Fix a free group F = Fm on m generators and let R,S be arbitrary 
subsets, with normal closures ((i?)),{(S')). Let (f : F ^ Fl{{R)) and tp : F ->■ F/{{S)) be the quotient 
homomorphisms. Then there exist canonical isomorphisms 

iF/({R)))/U{S))) = F/({RuS)) = iF/({S)))/mR))) 

that are compatible with the underlying presentation (i.e., the projections from F commute with these 
isomorphisms). 

Proof. We will abuse notation by writing strings from F and interpreting them in the various quotients 
we are considering. Then if G = (F | T) = F/{{T} is a quotient of F and 17 is a subset of F, we can 
write (G I U} to mean Fj{{T u U]) and can equally well write {F \ T,U). Then the isomorphisms we 
need just record the fact that 

{F\R,S) = {F/{{R))\S) = {F/{{S))\R). □ 

Because of this standard abuse of notation where we will variously interpret a string in {oi,..., Om}* 
as belonging to F^, fVs.mi or some other quotient group, we will use the symbol =g to denote equality 
in the group G when trying to emphasize the appropriate ambient group. 

2. Random walk and arithmetic uniformity 

In this section we record some properties of the simple nearest-neighbor random walk (SRW) and 
the non-backtracking random walk (NBSRW) on the integer lattice Z™, then deduce consequences 
for the distribution of Mal’cev coordinates for random relators in free nilpotent groups. (See the 
Appendix for additional details [3].) For the standard basis {ci} of Z™, SRW is defined by giving the 
steps iCi equal probability I/2m, and NBSRW is similarly defined but with the added condition that 
the step ±ei cannot be immediately followed by the step Ta (that is, a step can’t undo the immediately 
previous step; equivalently, the position after k steps cannot equal the position after k + 2 steps). A 
random string wi of £ letters from {oi,..., Om}* has the form wi = aia 2 ---ai, where the at are i.i.d. 
random variables which equal each basic generator or its inverse with equal probability I/2m. The 
abelianization Xg = A{w() is a Z^-valued random variable corresponding to Fstep SRW. A random 
freely reduced string does not have an expression as a product of variables identically distributed 
under the same law, but if vi is such a string, its weight vector = A{vi) is another Z^-valued 
random variable, this time corresponding to NBSRW. 

It is well known that the distribution of endpoints for a simple random walk in Z™ converges to 
a multivariate Gaussian: if Xi is again the random variable recording the endpoint after £ steps of 
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simple random walk on Z"*, and St is the dilation in sending v tv, we have the central limit 
theorem: 

Vi v m 7 

This convergence notation for a vector-valued random variable V{, and a multivariate normal S) 
means that V( converges in distribution to AW + fi, where the vector /i is the mean, S = AA^ is the 
covariance matrix, and kb is a vector-valued random variable with i.i.d. entries drawn from a standard 
(univariate) Gaussian distribution A/’(0,1). In other words, this central limit theorem tells us that 
the individual entries of Xi are asymptotically independent, Gaussian random variables with mean 
zero and expected magnitude \/£/to. This is a special case of a much more general result of Wehn 
for Lie groups and can be found for instance in [TJ Thm 1.3]. Fitzner and van der Hofstad derived a 
corresponding central limit theorem for NBSRW in [6]. Letting Yi be the Z'^-valued random variable 
for Gstep NBSRW as before, they find that for m>2, 

Note that the difference between the two statements records something intuitive: the non-backtracking 
walk still has mean zero, but the rule causes the expected size of the coordinates to be slightly higher 
than in the simple case; also, it blows up (as it should) in the case m = 1. 

The setting of nilpotent groups is also well studied. To state the central limit theorem for free 
nilpotent groups, we take St to be the similarity which scales each coordinate from MB^ by , so that 
for instance in the Heisenberg group, St{x,y,z) = {tx,ty,t^z). 

Proposition 12 (Distribution of Mal’cev coordinates). Suppose NB^ is an iVs „j-valued random 
variable chosen by non-backtracking simple random walk (NBSRW) on {ai,...,am}* for £ steps. 
Then the distribution on the Mal’cev coordinates is asymptotically normal: 

NBf ~ A/'(0,E). 

Vi 

For SRW, this is called a “simple corollary” of Wehn’s theorem in (TJ Thm 3.11]), where the only 
hypotheses are that the steps of the random walk are i.i.d. under a probability measure on X that 
is centered, with finite second moment (in this case, the measure has finite support, so all moments are 
finite). Each Mal’cev coordinate is given by a polynomial formula in the a-weights of the variables at 
(the polynomial for an MBj coordinate has degree j—for instance in H{Z) = iV 2,2 the coordinate C 
is a quadratic in A(ai)). The number of summands in the polynomial gets large as £ -»■ oo. Switching 
to NBSRW, it is still the case that NB^ is a product of group elements whose a-weight vectors are 
independent and normally distributed, so their images under the same polynomials will be normally 
distributed as well, with only the covariance differing from the SRW case. We sketch a simple and 
self-contained argument for this in the N 2^2 non-backtracking case—that the third Mal’cev coordinate 
in i?(Z) is normally distributed— which we note is easily generalizable to the other Ng^m with (only) 
considerable notational suffering. Without loss of generality, the sample path of the random walk is 

g = ... a*’’Id’" 

for some integers is At summing to £ or £ - 1, with all but possibly ii and jr nonzero. After a certain 
number of steps, suppose the last letter so far was a. Then the next letter is either a, b, or with 
equal probability, so there is a 1/3 chance of repeating the same letter and a 2/3 chance of switching. 
This means that the is and jt are (asymptotically independent) run-lengths of heads for a biased coin 
(Bernoulli trial) which lands heads with probability 1/3. On the other hand, r is half the number of 
tails flipped by that coin in £ (or £ - 1) trials. In Mal’cev normal form, 

g— 

Thus the exponent of c is obtained by adding products of run-lengths together (Q times, and general 
central limit theorems ensure that adding many independent and identically distributed (i.i.d.) random 
variables together tends to a normal distribution. 


CORDES DUCHIN DUONG HO SANCHEZ 


Our distribution statement has a particularly nice formulation in this Heisenberg case, where the 
third Mal’cev coordinate records the signed area enclosed between the x-axis and the path traced out 
by a word in {a, 6}*. For instance, in the figure below, baba encloses area -3, which equals the c 
exponent in the normal form. 



Corollary 13 (Area interpretation for Heisenberg case). For the simple random walk on the plane, 
the signed area enclosed by the path is a normally distributed random variable. 

Next, we want to describe the effect of a group automorphism on the distribution of coordinates. 
Then we conclude this section by considering the distribution of coordinates in various Z/pZ. 

Corollary 14 (Distributions induced by automorphisms). If (j) is an automorphism of A^s,m and g is 
a random freely reduced word of length £ in {ui,..., Om}*, then the Mal’cev coordinates of ah{(j){g)) 
are also normally distributed. 

Proof. The automorphism </> induces a change of basis on the copy of Z™ in the MBi coordinates, 
which is given by left-multiplication by a matrix B € Then -> A/'(0,□ 

Note that normality of the MBj coordinates follows as well, as before: they are still described by 
sums of statistics coming from asymptotically independent Bernoulli trials, and only the coin bias has 
changed. 

Relative primality of MBi coefficients turns out to be the key to studying the rank of quotient 
groups, so we will need some arithmetic lemmas. 

Lemma 15 (Arithmetic uniformity). Let be the Z-valued random variable given by the a^-weight 
of a random word of length I in {oi,..., Om}*, for 1 < j < m. Let equal Ai^i with probability ^ 
and A(,-\^i with probability |. Then Ve > 0 3ci,C2 > 0 s.t. 

Vfc, PT(A(i = k mod n) <—. 

' ’ ' n 

More generally 3ci, C 2 > 0 s.t. for any s<m and distinct ii,..., is, 

\/ki,...,ks, Pr(A^,„ = fci,--, e mod n) < ^ + . 

V. '' n® 

In other words, the Z/nZ-valued random variables induced by the coordinate projections from 
random walk on the Mal’cev generators MBi approach independent uniform distributions. 

Proof. Depending on whether the random word is chosen as a random string or a random freely 
reduced string, A(^i is the Bh coordinate projection of either the SRW or the NBSRW Yg, and 
is the parity-corrected version. We first consider the residues mod n for simple random walk Xi on 
Z"* by studying its position on the discrete torus {ZlnZ)"^. We only need to consider the statement 
for s coordinates in the case s = m, since the results for s < m can be derived from this by summing: 
for instance, the positions satisfying 7Ti{X£) = ki for j = 1,2 are represented by positions in the 

torus, so the bound can be added polynomially many times to get the right main term, at the cost of 
slightly enlarging the constant C 2 . 

A theorem from Saloff-Coste [m Theorem 7.8] controls the distance from a lazy symmetric gener¬ 
ating random walk to the uniform distribution on any family of finite Cayley graphs which satisfies 
a uniform doubling bound on volume growth. (In our case, the growth ffBr is bounded on (Z/nZ)"* 
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by (2r + I)'", independent of n, and the graphs satisfy the doubling hypothesis.) First we will explain 
how this theorem provides the needed bound, then we will explain how to modify our random walk 
to satisfy the theorem’s hypotheses. 

In our notation, the theorem says that 

Pr(7ri(X^) E ki,---,7rm{Xe) = km mod n) < ^ + ^^^^2 , 

for the following reasons: the distance upper-bounds the difference in probabilities at any single 
point, and the diameter of (Z/nZ)™ is less than mn. Since n < we have and by 

enlarging the constants we obtain 

Pr(7ri(Xe) E ki,--,7rm(X£) E km mod n) < ^ + , 

as desired. 

In order to use this theorem on our (non-lazy) walk, we apply the following technique: we replace 
the simple random walk P with the two-step walk P * P which is lazy and symmetric. If n is odd, 
the support of P x- P is a generating set, and we can proceed. If n is even, P x P is supported on the 
sublattice of torus points where the sum of the coordinates is even, which does not generate. But in 
that case the random variable that we are studying (which takes either £ or £-1 steps) lives on the 
even or odd sublattice with equal probability; Saloff-Coste’s statement will ensure equidistribution on 
the even sublattice, and by symmetry, taking one more step will equidistribute on the odds. (To be 
precise, we should use ^/2 rather than £ on the right-hand side because of the parity fix, but this gets 
absorbed in the constants.) 

To handle NBSRW, we can construct a new state space whose states correspond to directed edges 
on the discrete torus; this encodes the one step of memory required to avoid backtracking. This 
new state space can itself be rendered as a homogeneous finite graph, and a similar argument can be 
applied. □ 

Corollary 16 (Uniformity mod p). The abelianization of a random freely reduced word in Fm has 
entries that are asymptotically uniformly distributed in Z/pZ for each prime p, and the distribution 
mod p is asymptotically independent of the distribution mod q for any distinct primes p, q. 

Proof. For independence, consider n = pq in the previous Lemma. Asymptotically uniform implies 
asymptotically independent. □ 

Corollary 17 (Probability of primitivity). For a random freely reduced word in Fm, the probability 
that it is primitive in abelianization tends to l/<('(m), where C is the Riemann zeta function. In 
particular, for m = 2, the probability is G/tt^. 

Proof. Using arithmetic uniformity, one derives a probability expression that agrees with 1 /^(to) by 
Euler’s product formula for the zeta function, as in [8]. For details, see the Appendix [3]. □ 

Remark 18 (Comparison of random models). As we have seen, abelianizations of Gromov random 
groups are computed as cokernels of random matrices M whose columns are given by non-backtracking 
simple random walk on Z"*. Most other models in the random abelian groups literature use somewhat 
different randomization set-ups. Dunfield and Thurston [5] use a lazy random walk: £ letters are 
chosen uniformly from the (2 to + 1) possibilities of a* and the identity letter, creating a word of length 
< £, whose abelianization becomes a column of M. Results by Kravchenko-Mazur-Petrenko [9] and 
Wang-Stanley m use the standard “box” model: integer entries are drawn uniformly at random 
from [-£,£], and asymptotics are calculated as £ ->■ oo. (This is the most classical way to randomize 
integers in number theory; see [H].) 

However, the main arguments in each of these settings rely on arithmetic uniformity of coordinates 
mod p to calculate probabilities of relative primality, which is why the Riemann zeta function comes 
up repeatedly in the calculations. 
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3. Preliminary facts about random nilpotent groups via abelianization 


In this section we make a few observations relevant to the model of random nilpotent groups we 
study below. In particular, there has been substantial work on quotients of free abelian groups Z™ by 
random lattices, so it is important to understand the relationship between a random nilpotent group 
and its abelianization. Below, and throughout the paper, recall that probabilities are asymptotic as 
I -> oo. 

First, we record the simple observation that depth in the LCS is respected by homomorphisms. 

Lemma 19. Let (j): G ^ H he a. surjective group homomorphism. Then (j){Gk) = Hk where Gfc, Hk 
are the level-fc subgroups in the respective lower central series. 

Proof. Since </) is a homomorphism, depth-fc commutators are mapped to depth-fc commutators, i.e., 
ipiGk) £ Hk- Let h e Hk- Without loss of generality we can assume /i is a single nested commutator h = 
[wi,Wk]. By surjectivity of (j) we can choose lifts wi, ..., Wk oiwi,... ,Wk. We see [wT,..., uifc] 6 Gk 
and (l){Gk) 2 Hk- □ 


To begin the consideration of ranks of random nilpotent groups, note that the Magnus lifting 
theorem (Theorem[3|) tells us the rank of Ns^rn/{{R)) equals the rank of its abelianization 7/^l{R), so 
we quickly deduce the probability of rank drop. 

Proposition 20 (Rank drop). For a random r-relator nilpotent group G = Ns^mlUgi, ■ ■ ■ ,gr)), 

Pr(rank(G) <m) = . 

C{rm) 

Proof. This follows directly from considering the existence of a primitive element in (ab(i?)). By 
Lemma IHl this occurs if and only if the rm entries are relatively prime, and by arithmetic uniformity 
fLemma lI5|) . this is computed by the Riemann zeta function, as in Corollary 1171 □ 


Next we observe that a nilpotent group is trivial if and only if its abelianization (i.e., the corre¬ 
sponding Z™ quotient) is trivial, and more generally it is finite if and only if the abelianization is 
finite. Equivalence of triviality follows directly from the Magnus lifting theorem (Theorem[3]). For the 
other claim, suppose the abelianization is finite. Then powers of all the images of at are trivial in the 
abelianization, so in the nilpotent group G there are finite powers off in the commutator subgroup G 2 . 
A simple inductive argument shows that every element of Gj has a finite power in Gj+i; for example, 
consider hij e G 2 . Since [a[Saj] = is a commutator of elements from G 2 and Gi, it must be in G 3 , 
as claimed. But then we can see that there are only finitely many distinct elements in the group by 
considering the Mal’cev normal form 

g = u\u 2 ...u* 

and noting that each exponent can take only finitely many values. Since the rank of a nilpotent group 
equals that of its abelianization (by Theorem [3] again), it is also true that a nilpotent group is cyclic 
if and only if its abelianization is cyclic. 

We introduce the term balanced for groups presented with the number of relators equal to the 
number of generators, so that it applies to models of random groups Fml{{R)), random nilpotent 
groups Ns^m/{{R)), or random abelian groups I/^I{R), where |i?| = m, the rank of the seed group. We 
will correspondingly use the terms nearly-balanced for |i?| = m - I, and underbalanced or overbalanced 
in the |i?| < m - 1 and |i?| > m cases, respectively. 

Then it is very easy to see that nearly-balanced (and thus underbalanced) groups are a.a.s. infinite, 
while balanced (and thus overbalanced) groups are a.a.s. finite because m random integer vectors (in 
any of the models) are R-linearly independent with probability one. However, is also easy to see that 
if |i?| is held constant, no matter how large, then there is a nonzero probability that the group is 
nontrivial (because, for example, all the a-weights could be even). 

To set up the statement of the next lemma, let Z{m) := C(2)... C{m) and 


P{m) 


n 

primes p \ ^ / 
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As in Remark [T51 we can quote the distribution results of [S],[S],[T7] because of the common feature 
of arithmetic uniformity. 

Lemma 21 (Cyclic quotients of abelian groups). The probability that the quotient of Z™ by m - 1 
random vectors is cyclic is 1/Z(m). With m random vectors, the probability is P{m)IZ{m). 

These facts, particularly the first, can readily be derived “by hand,” but can also be computed 
using Dunfield-Thurston as follows: their generating functions give expressions for the probability 
that i random vectors with Z/pZ entries generate a subgroup of rank j, and the product over primes of 
the probability that the Z/pZ reduction has rank > m -1 produces the probability of a cyclic quotient 
over Z. 

The latter fact appears directly in Wang-Stanley m as Theorem 4.9(i). We note that correspond¬ 
ing facts for higher-rank quotients could also be derived from either of these two papers, but the 
expressions have successively less succinct forms. 

Corollary 22 (Explicit probabilities for cyclic quotients). For balanced and nearly-balanced presen¬ 
tations, the probability that a random abelian group or a random nilpotent group is cyclic is a strictly 
decreasing function of m which converges as m ->• oo. 

In the balanced case, the limiting value is a well-known number-theoretic invariant. Values are 
estimated in the table below. 

The convergence for both cases is proved in [m Thm 4.9] as a corollary of the more general 
statement about the Smith normal form of a random not-necessarily-square matrix M, which is an 
expression A = SMT for invertible S,T in which A has all zero entries except possibly its diagonal 
entries an = ai. These ai are then the abelian invariants for the quotient of Z™ by the column span 
of M (that is, they are the di from RemarkObut with opposite indexing, di = am+i-i)- The rank of 
the quotient is the number of these that are not equal to 1. 

The probabilities of cyclic groups among balanced and nearly-balanced quotients of free abelian 
groups and therefore also for random nilpotent groups are approximated below. Values in the table 
are truncated (not rounded) at four digits. 


Pr( cyclic) 

m = 2 

m = 3 

m = 4 

TO = 10 

TO = 100 

TO = 1000 

m ^ oo 

|i?j = m - 1 

.6079 

.5057 

.4672 

.4361 

.4357 

.4357 

.4357 

\R\ = m 

.9239 

.8842 

.8651 

.8469 

.8469 

.8469 

.8469 

Computing the probability of 

a trivial quotient with r 

relators is 

equivalent to the the probability 


that r random vectors generate Z"*. 

Lemma 23 (Explicit probability of trivial quotients). For r > m, 

This is a rephrasing of [9l Cor 3.6] and [171 Thm 4.8]. 

Remark 24. From the description of Smith normal form, we get a symmetry in r and m, namely 

Pr (rank (Z™/^ = m - fc) = Pr (rank (27(?;i, ... ,v„,)) = ^ - ^) Vl<fc< min(r, m) 

just by the observation that the transpose of the normal form expression has the same invariants. For 
example, applying duality to Lemma [2T] and reindexing, we immediately obtain, as in Lemma 1231 

p (Z™ / - ^ _ 

z{m+l) ~ C(2)-C(m-Hl)' 
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|i?| = 123456789 10 


rank(G') = 3 
2 
1 
0 

|i?| = l 2 3 4 5 6 7 8 9 10 



rank(G) = 4 
3 
2 
1 
0 




|i?| = l 2 3 4 5 6 7 8 9 10 


rank(G) = 5 



|i?| = 123456789 10 


rank(G) = 6 



|i?|=l 2 3 4 5 6 7 8 9 10 



|i?|=l 2 3 4 5 6 7 8 9 10 



|i?|=l 2 3 4 5 6 7 8 9 10 



|i?|=l 2 3 4 5 6 7 8 9 101112131415161718192021222324252627282930 


Figure 1. The empirical distribution of ranks in for m= 2,3,4,5,6,7,8,15. 
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4. Quotients of the Heisenberg group 

We will classify all G ■■= H{Z)l{{g} for single relators g, up to isomorphism. As above, we write a, b 
for the generators of H(Z), and c = [a, 6 ]. With this notation, H(Z) can be written as a semidirect 
product X Z via ( 6 , c) x (a) with the action of Z on Z^ given by ba = abc~^, ca = ac. 

Theorem 25 (Classification of one-relator Heisenberg quotients). Suppose g = + 1. Let d = 

gcd(j,j), let m = - 1) + fc as in Lemma [5l and let D = gcd(d, m). Then 

G-.= H{Z)I,. if* = j=0; 

Gig)) |(Z/^Z X Z/DZ) xZ, else, 

with the convention that Z/OZ = Z and Z/IZ = {!}. In particular, G is abelian if and only if 5 = 
or gcd(i, j) = 1; otherwise, it has step two. Furthermore, unless 5 is a power of c (the i = j = 0 case), 
the quotient group is virtually cyclic. 

Note that this theorem is exact, not probabilistic. 

Remark 26 (Baumslag-Solitar case). The Baumslag-Solitar groups are a famous class of groups given 
by the presentations BS{p,q) = {a,b\ a¥'a~^ = b’^) for various p,q. For the Heisenberg quotients as 
described above, we will refer to U = 1 as the Baumslag-Solitar case, because in that case sd-tm = l 
has solutions in s,t, and one easily checks that the group is presented as 

G={a,b\ [a,6] = &“, b^^ = 1) = + td)/ 

a 1-relator quotient of a solvable Baumslag-Solitar group BS{l,q). 

Examples: 

(1) if (? = a, then G = 1. 

(2) if g = c, then G = 1?. 

(3) if (? = c^, then G = (Z x Z/2Z) x Z. 

(4) if 5 = we have d = 4, m = 226, D = 2, so we get 

(5) a g = a^b'^c^, we have d = 2, m = 3, D = 1. In this case, b^ =c (? =g 1 and the quotient group 
is isomorphic to Z/4Z x Z with the action given by aba~^ = b^. This is a two-step-nilpotent 
quotient of the Baumslag-Solitar group BS{1,3) by introducing the relation b'^ = 1. 

We see that the quotient group G collapses down to Z precisely if gcd(i, j) = 1. Namely, c=g 1 in 
that case, so we have a quotient of Z^ by a primitive vector. 

Corollary 27. For one-relator quotients of the Heisenberg group, G = A^ 2 , 2 /(( 5 )), 

Pr(G = Z) = « 60.8% ; Pr(G step 2, rank 2) = 1 - 4-. 

TT^ TT^ 

Of course, if g = c, we have 1?, but this event occurs with probability zero. If gcd(i,j) + 1, then G 
is two-step (thus non-abelian) and has torsion. 

Proof of theorem. First, the {i,j) = (OjO) case is very straightforward: then g = and the desired 
expression for G follows. 

Below, we assume {i,j) (OjO), and by LemmaEl without loss of generality, we will write g = b‘^c^. 
Consider the normal closure of b, which is (( 6 )) = ( 6 ,c). This intersects trivially with (a), and 
G={(b)){a). ThusG=(&,c>x(a). 

Now in H{'L), we compute (( 5 )) = c ( 6 ,c). Thus 
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We have the semidirect product structure G = j d where the action sends (J(}) 

and fixes (°). Note that c has order d in G, and a simple calculation verifies that b has order d^fD, 
where D = gcd{d,m). If we are willing to lose track of the action and just write the group up to 
isomorphism, then we can perform both row and column operations on [ ^ ^ ] to get ^ j, which 

produces the desired expression. □ 


In fact, we can say something about quotients of H(X) with arbitrary numbers of relators. First 
let us define the K-factor K{R) of a relator set R = {gi,... ,gr}, where relator gi has the Mal’cev 

coordinates and similarly for g 2 ,...,gr- Let M = ( *.^ I and suppose its 

\ Ji ■ Jr / 

nullity (the dimension of its kernel) is q. Then let IF be a kernel matrix of M, i.e., an r x g matrix 
with rank q such that MW = 0. (Note that if i? is a random relator set, then q = r - 2, since the rank 
of M is 2 with probability one.) Let k = (fci,... , kr) be the vector of c-coordinates of relators, so that 
kW € Z"?. Then K{R) := gcd(fclF) is defined to be the gcd of those q integers. 


Theorem 28 (Orders of Heisenberg quotients). Consider the group G = iL(Z)/((gi,..., ^r)), where re¬ 
lator gi has the Mal’cev coordinates {ii,ji,ki), and similarly for g 2 , - ■ ■ ,gr- Let d = gcd(Ii, jd ■ ■ ■ ,ir,jr)] 
let A be the co-area of the lattice spanned by the (*“ ) in Z^; and let K = K{R) be the iiT-factor 
defined above. Then c has order 7 = gcd(d, K) in G and |G| = A • 7 . 


Proof. Clearly A is the order of ab(G) = Gl{c). So to compute the order of G, we just need to show 
that the order of c in G is 7 . Consider for which n we can have c" 6 ([gi,... ,gr)), i.e., 

N 

n ET Cq, -1 

C = 1 1 Wa 5a 

cx.=l 


for arbitrary words Wa and integers Ca- First note that all commutators \w,ga\ are of this form, and 
that by letting w = a ov b, these commutators can equal or cJ°‘ for any a, so n can be an arbitrary 
multiple of d. 

Next, consider the expression in full generality and note that A(c") = (q). Conjugation pre¬ 
serves weights, so A{wag^wf^^) = A{gfj‘) = eo,A{ga) = Ca (}“ )• To get the two sides to be equal in 
abelianization, the Ca must record a linear dependency in the (}“ )• Finally we compute 


n— ^^Ca(^aj/a Uaiof) '^aja 

a a<P a 


f) 


^aka 


where (y^) = We can observe that each of the first three terms is a multiple of d and the 

fourth term is an arbitrary integer multiple of K. (To see this, note that the column span of W is 
exactly the space of linear dependencies in the A{ga), so ff^aka is a scalar product of the k vector 
with something in that column span, and is therefore a multiple of K.) Thus n can be any integer 
combination of d and K, as we needed to prove. □ 


We will include experimental data about the distribution of random Heisenberg quotients in Sec¬ 
tion [71 


5. Rank drop 

First, we establish that adding a single relator to a (sufficiently complicated) free nilpotent group 
does not drop the nilpotency class; the rank drops by one if the relator is primitive in abelianization 
and it stays the same otherwise. Furthermore, a single relator never drops the step unless the starting 
rank was two. This is a nilpotent version of Magnus’s famous Freiheitssatz (freeness theorem) for free 
groups [ini Thm 4.10]. 

Theorem 29 (Nilpotent Freiheitssatz). For any g 6 Ng^m with s > 2,m > 3, there is an injective 
homomorphism 

Ns,m-1 ^ ^s,m/{{g))- 
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This is an isomorphism if and only if gcd(Ai( 5 ),..., Ajn{g)) = 1- 
If m = 2 the result holds with Z ^ Ns^ 2 /({g])- 

Proof. Romanovskii’s 1971 theorem [Ml Thm 1] does most of this. In our language, the theorem says 
that if Am{g) + 0, then (oi,..., Om-i) is a copy of Ns^m-i- This establishes the needed injection except 
in the case g € [A^s,m,where A(g) is the zero vector. In the m = 2 case, any such Ns^ 2 /{{g} 
has abelianization Z^, so the statement holds. For m > 2, one can apply an automorphism so that g 
is spelled with only commutators involving am- Even killing all such commutators does not drop the 
nilpotency class because m> 2 ensures that there are some Mal’cev generators spelled without Om in 
each level. Thus in this case (oi,..., Om-i) = Ng^m-i still embeds. 

It is easy to see that if g is non-primitive in abelianization, then the rank of ah{Ns^mHg))) is m, 
and so the quotient nilpotent group has rank m as well. However, the image of Romanovskii’s map 
has rank to - 1 , so it is not a surjection. 

On the other hand, suppose ah^g) is a primitive vector. Then the rank of the abelianized quotient 
is TO - 1, and by Magnus’s theorem (Theorem [3]) the rank of the nilpotent quotient is the same. The 
group G = Ns^ml{{g} is therefore realizable as a quotient of that copy of Ns^m-i- Since the lower central 
series of Ns^m-i has all free abelian quotients, any proper quotient would have smaller Hirsch length, 
and this contradicts Romanovskii’s injection. Thus relative primality implies that the injection is an 
isomorphism. □ 

Now we can use rank drop to analyze the probability of an abelian quotient for a free nilpotent 
group in the underbalanced, nearly balanced, and balanced cases (i.e., cases with the number of 
relators at most the rank). 

Lemma 30 (Abelian implies rank drop for up to to relators). Let G = Ns^m/{{R)), where R = 
{gi,. ■ ■, gr} is a set of r < to random relators. Suppose s > 2 and my 2. Then 

Pr(G' abelian | rank(G') = to) = 0. 

Proof. Suppose that rank(G') = to and G is abelian. We use the form of the classification of abelian 
groups (Remark [7]) in which G = ©(O^Z/diZ, where dm | | di so that di = • • • = d, = 0 for 9 = dim(G), 

and we write {(ab(i?))) = (dici,..., dm^m) for a basis {a} of Z™. Since rank(G) = to, we can assume 
no di = 1. We can lift the basis {a} of Z"* to a generating set {ui} of Ns^m by Magnus (Theorem[3]). 
Note that the exponent of each generator in each relator is a multiple of dm- 

Next we show that we cannot kill a commutator in G without dropping rank. Let bi = [ 01 , 0 ^]. 
We claim that bi ^ ([gi,... ,gr))- To do so, we compute an arbitrary element 

n 

Ylwagl^W~^ € ((gi,...,g^)). 

OL 

Conjugation preserves weights, so A{wag^°w~^) = A(( 7 ^“) = eaA{gi^). If the product is equal to bi, 
then its a-weights are all zero. Now consider the 6 -weights. For the product, the 6 -weights are the 
combination of the 6 -weights of the ga, modified by amounts created by commutation. However, since 
all the a-exponents of all the ga are multiples of dm, we get 

^e*A(g,) = I? j , ^eiB(g,) E j (mod dm), 

where each et is the sum of the corresponding to gi- The second expression ensures that the et are 
not all zero, so the first equality is a linear dependence in the A(gi), which has probability zero since 
r < m- □ 

Theorem 31. (Underbalanced quotients are not abelian) Let G = Ns^m/{{R}, where R = {gi ,... ,gr} 
is a set of r < to - 2 random relators gi- Then 

Pr(G abelian) = 0. 
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Proof. Suppose that G is abelian, and consider elements of G as vectors in Z"^ via the abelianiza- 
tion map on Ng^rn] in this way we get vectors vi = A{gi),... ,Vr = A(gr). From the previous result 
we may assume rank(G') < m. By Lemma |51 we can find a primitive vector in as a linear combi¬ 
nation of the Vi. Then we apply the linear algebra lemma (Lemma 1101) to extend w appropriately 
so that span(i;i,..., Ur) = spaii(w,W 2 , ■ ■ ■ ,Wr). We can find a series of elementary row operations 
(switching, multiplication by -1, or addition) to get (w,W 2 , ■ ■ ■ ,'Wr) from (ui,...,Ur), and we lift 
these operations to elementary Nielsen transformations (switching, inverse, or multiplication, respec¬ 
tively) in Ns^m to get {g', g 2 , ■ ■ ■, 9r) from {gi,... ,gr). Note that Nielsen transformations on a set 
of group elements preserve the subgroup they generate, so also preserve normal closure. This lets 
us define R' = {g',g 2 , ■ ■ ■ ,gr} with {{R'} = {{R}- Since g' has a weight vector w whose coordinates 
are relatively prime, the Freiheitssatz (Theorem [221) ensures that Ng ml{{g')) = Ng m-i- Thus we have 
G = Ng,^.J{{g!2,...,g'J. 

If r < TO - 2, then iterating this argument r - 1 times gives G = Ng^m-r+il{{gr)) for some new gr, and 
TO - r -I- 1 > 3. Then we can apply Theorem 1291 to conclude that this quotient is not abelian, because 
its nilpotency class is s > 1. □ 

Proposition 32 (Cyclic quotients). If |i?| = to - 1 or |i?| = to, then abelian implies cyclic: 

Pr(G cyclic | G abelian) = 1. 

Proof. Running the proof as above, we iterate the reduction to - 2 times to obtain G = Ng^ 2 /{{g)) or 
Ng/l{{g,g')). 

If g (or any element of {(g,g'))) is primitive, then G is isomorphic to Z or a quotient of Z, i.e., G is 
cyclic. 

Otherwise, note that N := Ng ^2 has the Heisenberg group as a quotient {H{Z) = N 1 /N 3 ). If G is 
abelian, then the corresponding quotient of H(lf) is abelian. In the non-primitive case, this can only 
occur if c 6 ((g, g')), which (as in the proof of Lemma [30l) implies A(g) = (0,0) (or a linear dependency 
between A{g) and A{g'y). But by Corollary [HI the changes of basis do not affect the probability of 
linear dependency, so this has probability zero. □ 

Corollary 33. For nearly-balanced and balanced models, the probability that a random nilpotent 
group is abelian equals the probability that it is cyclic. 

We reprise the table from recalling that values are truncated at four digits. 


Pr(abelian) 

TO = 2 

TO = 3 

TO = 4 

TO = 10 

TO = 100 

TO = 1000 

m ^ 00 

|ii| = TO - 1 
i?| = TO 

.6079 

.9239 

.5057 

.8842 

.4672 

.8651 

.4361 

.8469 

.4357 

.8469 

.4357 

.8469 

.4357 

.8469 


Corollary 34 (Abelian one-relator). For any step s > 2, 

PT:{Ng^rnlig} is abelian) = 

Note that these last two statements agree for to = 2, |i?| = to - 1 = 1. 

6. Trivializing and perfecting random groups 

In this final section, we first observe the low threshold for collapse of a random nilpotent group, 
using the abelianization. Then we will prove a statement lifting facts about random nilpotent group 
to facts about the LCS of classical random groups, deducing that random groups are perfect with the 
same threshold again. 

Recall that Tj^m = {[aq,. ■., ] : 1 < H,..., < to} contains the basic nested commutators with j 

arguments. In this section we fix to and write F for the free group, so we can write Fi for the groups 
in its lower central series. Similarly we write N for W m (when s is understood), and T,- for T, „,. 
Note that {{Tj )) = F^, so N = F/Fg^i. 


G/tt^, to = 2 
0, TO > 3. 
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For a random relator set R c F, we write F = F/{{R}, G = N/{{R}, and H = Z'"/(i?) = ab(r) = 
ab(G), using the abuse of notation from Lemma [TT] and treating i? as a set of strings from F to be 
identified with its image in N or Z™. In all cases, R is chosen uniformly from words of length £ or 
i-linF. 

First we need a result describing the divisibility properties of the determinants of matrices whose 
columns record the coordinates of random relators. 

Lemma 35 (Common divisors of random determinants). Fixing m and any k > 10m, let = 
gcd(A^_i,..., Ai^k) be the greatest common divisor of the determinants of k random mx m matrices 
all of whose columns are independently sampled from Ai. Then, as £ oo, 

To prove this carefully requires dividing the primes into size ranges and verifying that only the 
small primes {p < log log £) contribute. See the Appendix [3] for details. 

The following theorem tells us that in sharp contrast to Gromov random groups, where the number 
of relators required to trivialize the group is exponential in £, even the slowest-growing unbounded 
functions, like log log log £ or an inverse Ackermann function, suffice to collapse random abelian groups 
and random nilpotent groups. 

Theorem 36 (Collapsing abelian quotients). For random abelian groups H = 17^I{R) with |i?| random 
relators, if |i?| ^ oo as a function of £, then H = {0} with probability one (a.a.s.). If |i?| is bounded 
as a function of £, then there is a positive probability of a nontrivial quotient, both for each £ and 
asymptotically. 


Pr(df ^ = 1) 


n 1 




Proof. For a relator g of length £, its image in Z”^ is the random vector A(g), which converges in 
distribution to a multivariate normal, as described in ^ Furthermore, the image of this vector in 
projection to Z/pZ has entries that are asymptotically independently and uniformly distributed. We 
will consider adding vectors to this collection R until they span Z’", which suffices to get H = {0}. 

Choose m vectors vi,...Vm in Z™ at random. These vectors are a.a.s. R-linearly independent, 
because their distribution is normal and linear dependence is a codimension-one condition. Therefore 
they span a sublattice Li c Z’". The covolume of Li (i.e., the volume of the fundamental domain) is 
= det(ui,... ,Vm)- As we add more vectors, we refine the lattice. Note that = I if and only 
if Li = 17^. Similarly define Lj to be spanned by 'y(j-i)m+ii ■ • ■ i '^’jm for j = 2,3,..., and define A^j to 
be the corresponding covolumes. 

Note that for two lattices L,L', the covolume of the lattice L u L' is always a common divisor of 
the respective covolumes A, A'. Therefore, the lattice Li u - • • uL^, has covolume < gcd(A^^i,..., Ai^k)- 
From Lemma 1551 this gcd approaches 


n 1- 


I- 


p)( p^) ( p™ 



as £ ->• oo, and this in turn goes to 1 as fc ^ oo. (To see this, note that first applying a logarithm, then 
exchanging the sum and the limit, gives an absolutely convergent sequence.) 

On the other hand, it is immediate that for any finite |i?| there is a small but nonzero chance that 
all entries are even, say, which would produce a nontrivial quotient group. □ 


Of course this also follows immediately from the statement in Lemma because 
PT{span{vi,... ,Vr} = = — - 1 I 

for any fixed m as r ->• oo. 

We immediately get corresponding statements for random nilpotent groups and standard random 
groups. Recall that a group F is called perfect if F = [F,r]; equivalently, if ab(r) = r/[F,r] = {0}. 
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Corollary 37 (Threshold for collapsing random nilpotent groups). A random nilpotent group G = 
Ns,ml{{R)) is a.a.s. trivial precisely in those models for which |i?| ^ oo as a function of £. 

Corollary 38 (Random groups are perfect). Random groups T = Fjn/({R} are a.a.s. perfect precisely 
in those models for which \R\ cxj as a function of i. 

Proof. 7 j"^I{R) = {0} ab(r) = {0} ab(G) = {0} G = {!}, with the last equivalence 
from Theorem [3 □ 

We have established that the collapse to triviality of a random nilpotent group G corresponds to 
the immediate stabilization of the lower central series of the corresponding standard random group: 
Ti = r2 = ... In fact, we can be somewhat more detailed about the relationship between G and the 
LCS of r. 

Theorem 39 (Lifting to random groups). For T = Fml{{R)) and G = Ns^m/{{R}, they are related by 
the isomorphism T/T^j+i = G. Furthermore, the first s of the successive LCS quotients of F are the 
same as those in the LCS of G, i.e.. 


Fi/Fi+i ^ Gi/Gi+i forl<i<s. 

Proof. Since homomorphisms respect LCS depth ILemma fTOl) . the quotient map (p : F ^ T gives 
(j){Fj) = Fj for all j. We have 


F/r,^i = F/({R, Fs^i)) = N/({R)) = G 
by Lemma [TT] (string arithmetic). 

From the quotient map : F ^ G, we get F^/Fg+i = ipiTi) = Gi. Thus 

GG^ =Fj/Fs+i/jn jY = Tj/p _ Q 
' Gri+l 'ii+l/is+1 'ii+1 

Corollary 40 (Step drop implies LCS stabilization). For G = Ng^mlUR}, if step(G) = k < s = 
step(A^s.m)) then the LCS of the random group F stabilizes: Ffe+i = Tk +2 = .... 

Proof. This follows directly from the previous result, since step(G) = k implies that G^+i = Gk +2 = 1, 
which means Gk+ilGk +2 = 1- Since fc + 1 < s, we conclude that r/c+i/Ffc +2 = 1. Thus Ffe +2 = F^+i, and 
it follows by the definition of LCS that these also equal F^ for all i > fc + 1. □ 

Thus, in particular, when a random nilpotent group (with m > 2) is abelian but not trivial, the 
corresponding standard random group has its lower central series stabilize after one proper step: 

... F 4 = F 3 = F 2 < Fi = F 

For instance, with balanced quotients of F2 this happens about 92% of the time. 

In future work, we hope to further study the distribution of steps for random nilpotent groups. 


7. Experiments 

7.1. Multi-relator Heisenberg quotients. The following table records the outcomes of 10,000 
trials (1000 trials for each of the ten rows) with relators of length 999 and 1000, for G = iL(Z)/((i?)). 
Note that \R\ = 2 is the balanced case. 
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|G| = 1 
(trivial) 

G cyclic nontrivial 
(rk 1 ) 

G abelian noncyclic 
(rk 2 step 1 ) 

G nonabelian 
(rk 2 step 2 ) 

largest hnite order 



infinite 

finite 

infinite 

finite 

infinite 

finite 

— 

Ii?l = l 

0 

604 

0 

0 

0 

396 

0 

— 

2 

1 

1 

917 

0 

0 

0 

81 

11178 

3 

514 

0 

467 

0 

9 

0 

10 

717 

4 

766 

0 

228 

0 

2 

0 

4 

104 

5 

884 

0 

116 

0 

0 

0 

0 

7 

6 

945 

0 

55 

0 

0 

0 

0 

4 

7 

979 

0 

21 

0 

0 

0 

0 

3 

8 

981 

0 

19 

0 

0 

0 

0 

3 

9 

995 

0 

5 

0 

0 

0 

0 

2 

10 

997 

0 

3 

0 

0 

0 

0 

2 


Note that the triviality column comports closely with the probability of collapse described in ^ 


Pr(span{i;i,...,z;J = Z2) = —- , 

C(r- 1 )-C(r) 

which predicts 0,0,506,769,891,948,975,988,994, and 997 trivial quotients. 

7.2. Finite nonabelian quotients of balanced presentations. Because underbalanced (|i?| < 
m - 2) and nearly-balanced (|i?| = m - 1) presentations necessarily produce infinite groups, while the 
overbalanced case (|i?| > m +1) often collapses the group, balanced presentations are a good source for 
finite nonabelian quotients, as one sees in the table above. Consider balanced quotients of H{'L) for 
which the random relators have Mal’cev coordinates (ii, ji, fci) and (* 2 , jb, ^ 2 ). Letting A = |iij 2 -* 2 ji|, 
the group is finite if and only if A > 0, in which case the order of the abelianization is A. Letting 
d = gcd(ii, ji, * 2 , 72 ), we recall that d = 1 implies a cyclic quotient, so the finite nonabelian case requires 
A > 0 and d> 1. Having A > 0 implies that there are no nontrivial linear dependencies between ( b 'j 
and (), so K{R) = 0 (as in Theorem [28)l . making the order of c in the quotient group equal to d; 
since |(c)| = d, the quotient is nonabelian iff d > 1. Finally |G| = A ■ d, and we further note that d^ | A, 
so d^ divides the order of the group. This means that the smallest possible orders of nonabelian 
quotients are 8,16,24,27,32,40,48,54,... 

With small-order groups, one can easily classify by isomorphism type, asking for instance how many 
of the order-eight nonabelian groups are isomorphic to the quaternion group Q = {±1, ±i, ±j, ±k} and 
how many to the dihedral group ZI 4 . However, since the expected magnitude of each of the entries in 
( ) and () is ^/^, the value of A and hence the expected size of these quotient groups is growing 

fast with £. Therefore to illustrate the distribution of random nilpotent groups that are small-order 
nonabelian, we consider n = 10,000 trials with f = 9 or 10. In this sample, 562 of the quotients were 
finite nonabelian. 


order 

8 

16 

24 

27 

32 

40 

48 

54 

56 

64 

72 

80 

81 

88 

96 

125 

216 

frequency 

130 

138 

65 

41 

45 

32 

35 

24 

9 

18 

9 

3 

6 

1 

2 

2 

2 


Of the 130 groups in this sample of order eight, 33 were isomorphic to Q, and the other 97 to £> 4 . 


7.3. One-relator Heisenberg quotients. Finally, we use Lemma [5] and Theorem [25] to study the 
diversity of random infinite groups appearing in the one-relator case. Given a random relator whose 
Mal’cev coordinates are (i,j, fc), we first change variables as in the Lemma to obtain coordinates 
(Ojd, m), where d = gcd{i,j) and m = ^{d- 1) + k. In this presentation, as noted in the proof of the 
Theorem, ord(c) = d and ord( 6 ) = d^fD, while a has infinite order. Thus every word involving a has 
infinite order; on the other hand, b and c commute and so any word in those letters alone has order 
at most d^/D (note that this is divisible by d because D = gcd( fi,m)). Extracting information that is 
independent of presentation, we conclude that the order of the center is d and the largest order of a 
torsion element is d^/D. 
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We ran n = 20,000 trials with ^ = 999 or 1000 and plotted the frequency of each {(f‘lD,d) pair 
(Figure[2]). Besides the groups that are pictured, there were also four occurrences of (j, j) = (0,0) in the 
sample, with k values 55,187,230,580, that are not pictured. Because groups with distinct {(PjD^d) 
pairs must be non-isomorphic, our sample contains at least 202 distinct groups (up to isomorphism). 

Recall from Remark [H] that groups with D = 1 are called Baumslag-Solitar type because they are 
isomorphic quotients of some BS{\,q). But D = gcd{d,m), and from the expression m = ^{d- 1) +k 
we can note that 2d | ij if either i or j is even, in which case gcd(d, m) = gcd{i,j, k). (If both are odd, 
the situation splits into sub-cases depending on the 2-adic valuations.) This suggests heuristically 
that non-cyclic groups of Baumslag-Solitar type should occur with probability = .22398... 

The precise frequency of groups of this type in the sample was 4404, or 22.02%. 


130 

120 

no 

100 

90 

80 

70 

60 

50 

40 

30 

20 

10 

0 

-10 

A sem 

y non- 

the cv 

= BS{ 



largest torsion order {(fjD) 


ilog plot of {(PiD, d) in 20,000 random 1-relator quotients of the Heisenberg group with relator length 999 or 1000, showing at least 
isomorphic groups. Variously sized disks represent the number of occurrences of each d) value. Since D\d, all possibilities 

irves {d,d) and {d^,d). Of these random groups, 61% are isomorphic to Z and an additional 22% are of Baumslag-Solitar type 
l,q)/{{g} as in Remark!^, and thus lie along the lower curve {d?,d). 







22 


CORDES DUCHIN DUONG HO SANCHEZ 


Funding. This work was initiated in the Random Groups Research Cluster held at Tufts University 
in Summer 2014, supported by the National Science Foundation [DMS-CAREER-1255442]. 

Acknowledgments. The co-authors had many stimulating and useful conversations in the course 
of the work, including with Phil Matchett Wood, Nathan Dunfield, Dylan Thurston, Ilya Kapovich, 
Anschel Schaffer-Cohen, Rick Kenyon, and Larry Guth. Special thanks to Keith Conrad and to 
Hannah Alpert for several insightful observations, and to two anonymous referees for close reading and 
very helpful suggestions. Finally, Melanie Matchett Wood pointed out several issues in the preprint 
that merited more detail and care, which led to the creation of an appendix. We are extremely grateful 
for the feedback. 


References 

[1] E. Breuillard, Random walks on Lie groups. http://www.math.u-psud.fr/~breuilla/partOgb.pdf 

[2] Drutu and Kapovich, Lectures on geometric group theory. Preprint. 

[3] M. Duchin, M. Ho, A. Sanchez, Appendix to Random Nilpotent Groups I. https://arxiv.org/abs/1506.01426 

[4] M. Duchin, K. Jankiewicz, S. Kilmer, S. Lelievre, J. Mackay, and A. Sanchez, A sharper threshold for random groups 
at density one-half. Groups, Geometry, and Dynamics, Vol 10, Issue 3, 2016, 985-1005. 

[5] N. Dunfield and W. Thurston, Finite covers of random 3-manifolds. Invent. Math. 166 (2006), no. 3, 457-521. 

[6] R. Fitzner and R. van der Hofstad, Non-backtracking random walk. Journal of Statistical Physics 150(2): 264-284, 
(2013). 

[7] Marshall Hall, Jr., The theory of groups. Second edition. American Mathematical Society, 1976. 

[8] Hardy and Wright, An introduction to the theory of numbers. Sixth edition. Oxford University Press, 2008. 

[9] R. Kravchenko, M. Mazur, and B. Petrenko, On the smallest number of generators and the probability of generating 
an algebra. Algebra Number Theory 6 (2012), no. 2, 243-291. 

[10] Wilhelm Magnus, Abraham Karrass, Donald Solitar, Combinatorial group theory. Presentations of groups in terms 
of generators and relations. Second edition. Dover, 1975. 

[11] J. Nielsen, Die Isomorphismen der allqemeinen, unendlichen Gruppe mit zwei Erzeuqenden. (German) Math. Ann. 
78 (1917), no. 1, 385-397. 

[12] Yann Ollivier, A January 2005 Invitation to Random Groups, Ensaios Matematicos [Mathematical Surveys], vol. 
10, Sociedade Brasileira de Matematica, Rio de Janeiro, 2005, 31, 85—86. 

[13] Derek J.S. Robinson, A course in the theory of groups, 2nd ed.. Springer-Verlag, 1996. 

[14] N.S. Romanovskii, A freedom theorem for groups with one defining relation in the varieties of solvable and nilpotent 
groups on given lengths. (English translation.) Math. USSR-Sb. 18 (1972), 93-99. 

[15] L. Saloff-Goste, Random walks on finite groups. Probability on discrete structures, 263-346, Encyclopaedia Math. 
Sci., 110, Springer, Berlin, 2004. 

[16] Charles C. Sims, Computation with finitely presented groups. Encyclopedia of Mathematics and its Applications, 
48. Cambridge University Press, Cambridge, 1994. 

[17] Yinghui Wang and Richard P. Stanley, The Smith normal form distribution of a random integer matrix. 
https: //arxiv.org/abs/1506.00160 

[18] Melanie Matchett Wood, Random integral matrices and the Cohen Lenstra Heuristics. 
https: //arxiv.org/abs/1504.04391 


RANDOM NILPOTENT GROUPS I 


23 


Appendix A. Expanded information on arithmetic properties of random walks 

Moon Duchin, Meng-Che Ho, and Andrew Sanchez 

This appendix fills in details for some claims given above about arithmetic properties of random 
walks. We will focus here on the simple random walk (SRW) on Z™, where the arguments will be 
spelled out in full, but also give indications of how to extend these arguments to non-backtracking 
simple random walk (NBSRW). We will use the notation At for the Z'^-valued random walk, as above, 
and discuss the SRW and the NBSRW case separately in each argument below. 

If Et is an event that depends on a parameter £, we use the symbol Pr(A^) for the probability for 
fixed I and write Pr(E^) := lim Pr(E^) for the asymptotic probability. If E is an event with respect 

to a matrix-valued random variable, we use the notation Pi'{E) to denote the conditional probability 
of E given that no matrix entries are zero. 

We will analyze primes by their size relative to £, so we fix a small e (say 0 < e < and define size 
ranges: 

Vi:={p<\og\oge} V 2 -={\og\og£<p<£^-A V 3 ■= {£^~" < P < V 4 ■= {p > ■ 

Recall that Lemma ITK] provides that 3ci,C 2 > 0 s.t.Vn < £i~’^, VI < s < m, Vfci,..., 

Pi:(At i^=ki,---,Ai^i^=ks mod n] < — + . 

Here we will carefully establish the following two results from above. 

Corollary 17 (Probability of primitivity). For a random freely reduced word in E^, the probability 
that it is primitive in abelianization tends to 1 /<('(to), where is the Riemann zeta function. In 
particular, for m = 2, the probability is G/tt^. 

Lemma 35 (Common divisors of random determinants). Fixing m and any k > 10m, let = 
gcd(Af_i,..., At^k) be the greatest common divisor of the determinants of k random mx m matrices 
all of whose columns are independently sampled from At. Then, as £ 00 , 

Probability of primitivity. For SRW, proceeds like a lazy simple random walk on Z: at each 
step, it advances left or right with probability l/2m, and otherwise it stands still. A similar statement 
is true for NBSRW, but the probabilities depend on the previous step. As mentioned above, classical 
central limit theorems tell us that At^i is asymptotically normally distributed, and this is true for 
the NBSRW case as well [6]. In this appendix we will sometimes use information about the rate 
of convergence of At^i to the Gaussian distribution. For SRW, we have local central limit theorems 
(LCLT) which give upper bounds on the difference between the probability that At^i = x and the 
estimate derived from the Gaussian, in terms of x and £ (see for instance Lawler-Limic, Random 
Walk, A Modern Introduction, Chapter 2). For NBSRW, this is a folklore result that has not yet been 
written down, as far as we know. 

Lemma A.l (Divisibility of coordinate projections). For every m,n> 2, 1 < s <m, and £ » I, there 
is a conditional probability bound given by 

Pr'^At.! = ■•■ = Ai^s = 0 mod n) < 1/n®. 

In particular, Pr(At^i = 0 mod n | At^i 0) < Ijn for any fixed i. 

Proof. We give the detailed argument for s = 1. Let pt(,x) = Pr(A^^i = x). This result will follow from 
monotonicity of the distribution of At^i, i.e., pt{x) > pt{x + I) for x > 0. We proceed by induction on 


Pr(df ^ = 1) 


n 1- 







24 


CORDES DUCHIN DUONG HO SANCHEZ 


l. For ^ = 1, we have pi(0) = ^ and pi(l) = which establishes the base case. For £ > 1, we have 

2^ 

Pti^) = + - Pi-iix) + —p^_i(x + 1); 

2 m m 2m 

jYi — X X 

P£{x + 1) = — pi . i { x ) + - pi - i{x + l) + — pg . i{x + 2). 

2 m m 2m 

Now we know that > tA (since m > 2), and this means 

m 2m ' '' ’ 

777 — 1 1 1 777 — 1 

- Pi - i { x ) + — p £- i{x + 1) > — Pi - i { x ) +- P£_i(x + 1), 

m 2m 2m m 

since the LHS has a larger coefficient on the larger term. This compares two of the terms oipi { x ) with 
two of the terms of pi{x + 1), so it only remains to compare the remaining terms. Since x > 0, we have 
|x -1| < x + 1. Thus, by repeatedly applying the inductive hypothesis, we have p^_i(x- 1) > pi - i{x + 2), 
which completes the proof for all In particular, we have shown: if the positive integers Z>o are 
partitioned into intervals \kn + l,A:n + n], then the farthest point in each interval from 0 (the value 
divisible by n) has the lowest probability. 

For NBSRW, we would need to inspect the LCLT bounds to establish monotonicity rigorously, 
though it is intuitively clear for £ » 1. 

The argument for general s runs along exactly the same lines: is cut up into n x ■■■ x n boxes 

which are obtained as products of the intervals described above, then in each box, the point farthest 
from the origin (which satisfies the congruence condition in the statement of the lemma) has the lowest 
probability in the random walk. □ 


Lemma A.2 (Values of coordinate projections). There is a constant c 
any i and e > 0, 


Pr(4f.i 


a) 


< for £ » 1. 

XI 


such that for any a e Z and 


Proof. Bounding Pr(A^ ^ = a) by a multiple of £ follows from the standard local central limit 
theorem for SRW and could be extended to NBSRW from its LCLT. □ 


With this, we can establish the probability that a random relator is primitive in abelianization. 
Lemma A.3 (Corollary 17). Let Sg be the greatest common divisor of the entries of Ag^i. Then 

U(5,>1) = 1-^. 

C(m) 

Proof. Recall that for an event expressed in terms of a matrix-valued random variable, Pr^(£') denotes 
the conditional probability of E given that the entries of the matrix are nonzero (and this definition 
makes sense for vectors in particular). Since 

Pr((5^ > 1) ^ Pr'(some prime divides Sg) + Pr(some entry of Ag^i is zero), 

we have 

Pr(some p^Vi divides 5g) < Pr{5g > 1) < Pr'(some p^Vi divides 5g) + Pr'(some p divides Sg) 

+ Pr(some entry is zero). 

Recall the (well-known) fact that the product of all primes up to some N is asymptotically e^; this 

implies that H p < £^~'^. Thus we can apply Lemma [15] with n = to get asymptotic uniformity 

piVi Vi 

(and independence) for all of these primes at once. From this we get Pr^(some p eVi divides 5g) -> 
1 - , via the Euler product formula for the zeta function. By Lemma lA.ll 

Pr'(some p^V? divides 5g) < V — 

nriTfl 

piVi P 


0, 
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where the inequality is just the sum-bound Pr(Ui Ei) < Pr(£li) and it converges to zero as the tail 
of a convergent sequence. Lastly, Pr(some entry is zero) ^ 0 and the lemma follows. □ 

Common divisors of random determinants. We now build up a series of lemmas regarding 
divisibility with respect to our partition of the primes of determinants of random matrices Mi with 
columns independently distributed by Ai. We will refer to the upper left-hand kxk minor of such a 
matrix by (for 1 < fc < m). 

Lemma A. 4 (Divisibility of determinants by small primes). Let 

There exist constants Ci,C 2 > 0 such that for all £, k, and p < (i.e., p^Vi UP 2 ), 

I Pr(p| cf-p) - [Pm(p)]^ I < 

Furthermore, 

Pr(no p e Pi divides d^^^) = 0 “ [Pm,(p)]^) + . 

Pi 

Proof. The number of nonsingular matrices with Fp entries is 

\GLM\ = {p^ - 1) (p- -p)...(p--p-i) 

out of p"* total matrices [T^ , so the ratio of singular matrices is Pm (p). Thus the lemma follows 
from the fact that each entry of Mi approaches a uniform distribution with the error term decaying 
exponentially fast in Summing the error over the kmf entries appearing in k mx m matrices only 
worsens the constant C 2 that appeared in Lemma [T51 

For the second statement we use the fact, noted in the last proof, that probabilities are asymptot¬ 
ically uniform/independent for all primes in Vi- The Chinese Remainder Theorem ensures that for 
any mx m matrices A with entries in Z/pZ and B with entries in Z/gZ, there is a unique matrix C 
with entries in Z/pgZ that agrees with both in the respective projections. Using this repeatedly, with 
n = ri p, we count that the number of matrices over Z/nZ such that no p € Pi divides the determinant 

Pi 

must equal O-Pi IGLmClFp)!- The statement follows. □ 

To get a similar bound for large primes, we prove two lemmas on the divisibility of the determinants 
of the submatrices M^ , and then combine them for a bound that works on P 3 and P 4 . 

Lemma A. 5 (Divisibility of determinants by large primes). For e as above, there is a constant c such 
that for sufficiently large £, any 1 < k <m, and any prime p > (i.e., p e P 3 u P 4 ), we have 

Pr'(det e 0 mod p) < —r^ + 

where Pr^ denotes conditional probability given that the matrix entries are nonzero. It follows that 
there is a constant c such that Pr'(p | Ag^i) < for p 6 P 3 u P 4 , and Pr'(p | Ai^i) < cp^^ for 

pe P 3 . 

Proof. For fixed m, we start with the k = 1 case and raise k one increment at a time to show that the 
probability that M^^^ is divisible by p is p • When fc = 1, this follows from Lemma fA.ll 

Now suppose this is true for Mf '. Introduce the equivalence relation A B Oy = bij for all 

(fc,fc); that is, declare two kxk matrices equivalent if they agree in all entries except possibly 
the bottom right. Then there is a constant Cm for each matrix M such that 

det A = Ofcfc det A + Cm VA6[M], 

where N is the upper-left-hand (A: - 1) x (fc - 1) minor. Now if p -f det N, then solving for Okk gives 
(det A - Cm) (det A)“^ mod p, so at most 1/p of the auk values in Z give a possible solution. Thus 
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there are at most jp) + 1 matrices A e [M] with determinant divisible by p in this case, and 

since < p this has a conditional probability at most — + 1);^ < given that the 

matrix falls in the equivalence class. (The estimate comes from multiplying the number of matrices 
by the probability upper-bound for each matrix; this bound is subject to an exponentially decaying 
error because the independence is only asymptotic, but that is dominated by the \fi.) 

By the hypothesis, the probability that detiV is divisible by p is < thus 

Pr'(detM}''^ E 0 mod p) < + ^ + + ^ = -r i. After enlarging c, 

( k) 

the first statement follows for . For the last statement, we want to combine these three terms. 
Since p > we first observe that - < jq^ < and clearly as well. Note that if 


p < then t > pm+i, and we are done. 


□ 


Lemma A.6 (Nonsingularity). Pr(A = 0) = 0. 

Proof. The idea is that determinant zero is a codimension-one condition. To show it rigorously, we 
prove the following stronger result: for fixed m, we will show that Pr(detMj^^ = 0) = 0 for each 
1 < k < m. For fc = 1, we note that so the statement follows from Lemma [4.21 Let’s show 

that if it is true for then it is true for M = Let us write qt to denote the lower-right 

entry of and pf, to denote the list of the other -1 entries {Miq ,..., Mk^k-i)- The induction 
hypothesis tells us the probability that det N = 0 tends to zero for N the upper left-hand k - 1 x k - 1 
minor. Assuming that minor is nonsingular, there is exactly one value of making det AQ ' = 0 for 
each p; call it q{p.). But, recalling that 0 is the most likely value for q^ and that the different pi = p 
are disjoint events, we have 

Pr(detMj'‘^ = 0) = = qip)) < = 0) = Piiqi = 0). 

But qi is distributed like so by Lemma IAT21 this tends to zero. □ 


Lemma A.7 (Lemma 35). Fixing m and any k > 10m, we have 


Pr(dfUl)= n 1 





Proof. We’ll break down the probability by dividing the primes into the size ranges Vi, V 2 , T’s, and 
7^4- As above, let Pm(p) := 1 - (l - ••• (l - and note that Pm(p) < ^ because there 

are at most 2"* nonzero terms with denominators at least p. We clearly have the following bounds: 

Pr(p|(i^^^ for some p 6 Vi) < > 1) < Pr'(p|d^^^ for some p € Vi) + Pr'(p|d^^^ for some p 6 P2) 

-t- Pi {p\d‘f'’ for some p e P3) -1- Pr'(p|d^^^ for some p 6 Vi) 
+ Pr(some entry is zero). 

We apply Lemma IA.4I and take a limit to get 

Pr(p I 4^Uor some pe Pi) = 1 - U (1 - [Pm(p)]*)+0(e”^ )—^1- 0 (l “ [Pm(p)]^) • 

7^1 primes p 

We have thus shown that Pr(d^^^ >!)>!- n (1 - [Pm(p)]^^, which implies that 

primes p 

U(df = l)< n (l-[Prn(p)]'=). 

primes p 

Note that Pr' conditions on an event whose probability tends to 1, thus lim^^oo Pr'(A) = Pr(£') if the 
limits exist. 
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To finish the theorem we must show the other four terms that bound Pr((i^^^ > 1) limit to zero, 
starting with the primes in 7 ^ 2 - We have 

Y'Y{p\df^ iov sova.ep^V 2 ) <Y,^"^Wf^) = Y.{^rn{v) + 0{e~^^")\ —> 0, 

V2 V2 

where the Pm(p) term appears because p < means we can apply Lemma lA.41 To justify the 
convergence to zero, recall that Pm{p) < ^ and k> 2 . 

We now handle the case of P 3 , applying Lemma IXSl (and recalling that k > 10m and e < to get 

Pr'(p I for some p e V 3 ) < ^Pr'(p | ^ (Pr'(p | A^)^) < ^c-p 2 m+ 2 ^ < ^ 

V 3 Vi Vi Vi P 

Since the sum over all primes of p~^ converges, this tail certainly converges to zero as ^ ^ 00 . 

In the range Va, since all coordinates of the random walk vector are < £, we have | A^| < m\(™ < 
for £ » 1. Since A^ = 0 is an asymptotically negligible event fLemma lA. 6 l) . we have 

Pr'(p|d^^^ for some p e Va) —*• 0 , 

because Pr(p|d^^^ for some p e Va) = Pr(A £4 = ■•• = = 0) —> 0, so Pr = Pr'. Finally, the 

probability of a zero entry also goes to zero (Lemma IA.2I) . which completes the proof. □ 









